PPMexe: PPM for Code Compression

نویسندگان

  • Milenko Drinić
  • Darko Kirovski
چکیده

With the emergence of software delivery platforms such as Microsoft’s .NET, code compression has become one of the core enabling technologies strongly affecting system performance. In this paper, we present compression mechanisms for executables that explore their syntax and semantics to achieve superior compression rates. The fundament of our compression codec is the generic paradigm of prediction by partial matching (PPM). We combine PPM with two pre-processing steps: (i) instruction rescheduling to improve prediction rates and (ii) partitioning of a program binary. We improve the traditional PPM algorithm by (iii) using an additional alphabet of frequent variable-length super-symbols extracted from the input stream of fixed-length symbols. (iv) a low-overhead mechanism that enables decompression starting from an arbitrary instruction of the executable, a feature pivotal for run-time software delivery. The compression algorithm is implemented for x86 binaries and tested on several large Microsoft applications. Binaries compressed using the PPM algorithm enhanced with (i -iv), are 16-23% smaller than files created using off-the-shelf PPMD, the best available compressor.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PPMexe: PPM for Compressing Software

With the emergence of software delivery platforms such as Microsoft’s .NET, code compression has become one of the core enabling technologies strongly affecting system performance. In this paper, we present PPMexe a set of compression mechanisms for executables that explores their syntax and semantics to achieve superior compression rates. The fundament of PPMexe is the generic paradigm of pred...

متن کامل

Generic Adaptive Syntax-Directed Compression for Mobile Code

We propose a new scheme for compressing mobile programs. Our proposal is meant as part of a larger infrastructure for code distribution and deployment. In this paper we show how to effectively compress programs on the source level by compressing abstract syntax trees (ASTs) which are equivalent to source code (modulo comments and layout). We compress ASTs by adapting the wellknown PPM (predicti...

متن کامل

Text Compression using Recency Rank with Context and Relation to Context Sorting, Block Sorting and PPM*

Recently block sorting compression scheme was developed and relation to statistical scheme was studied, but theoretical analysis of performance has not been studied well. Context sorting is a compression scheme based on context similarity and it is regarded as an online version of the block sorting and it is asymptotically optimal. However, the compression speed is slower and the real performan...

متن کامل

PPM Performance with BWT Complexity: A New Method for Lossless Data Compression

This work combines a new fast context-search algorithm with the lossless source coding models of PPM to achieve a lossless data compression algorithm with the linear context-search complexity and memory of BWT and Ziv-Lempel codes and the compression performance of PPM-based algorithms. Both sequential and nonsequential encoding are considered. The proposed algorithm yields an average rate of 2...

متن کامل

Design and Analysis of Fast Text Compression Based on Quasi-Arithmetic Coding

We give a detailed algorithm for fast text compression. Our algorithm, related to the PPM method, simpli es the modeling phase by eliminating the escape mechanism and speeds up coding by using a combination of quasi-arithmetic coding and Rice coding. We provide details of the use of quasi-arithmetic code tables, and analyze their compression performance. Our Fast PPM method is shown experimenta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002